Oligonucleotide frequency matrices addressed to recognizing functional DNA sites

نویسندگان

  • Mikhail P. Ponomarenko
  • Julia V. Ponomarenko
  • Anatoly S. Frolov
  • Olga A. Podkolodnaya
  • Denis G. Vorobiev
  • Nikolay A. Kolchanov
  • G. Christian Overton
چکیده

MOTIVATION Recognition of functional sites remains a key event in the course of genomic DNA annotation. It is well known that a number of sites have their own specific oligonucleotide content. This pinpoints the fact that the preference of the site-specific nucleotide combinations at adjacent positions within an analyzed functional site could be informative for this site recognition. Hence, Web-available resources describing the site-specific oligonucleotide content of the functional DNA sites and applying the above approach for site recognition are needed. However, they have been poorly developed up to now. RESULTS To describe the specific oligonucleotide content of the functional DNA sites, we introduce the oligonucleotide alphabets, out of which the frequency matrix for a given site could be constructed in addition to a traditional nucleotide frequency matrix. Thus, site recognition accuracy increases. This approach was implemented in the activated MATRIX database accumulating oligonucleotide frequency matrices of the functional DNA sites. We have demonstrated that the false-positive error of the functional site recognition decreases if the oligonucleotide frequency matrixes are added to the nucleotide frequency matrixes commonly used. AVAILABILITY The MATRIX database is available on the Web, http://wwwmgs.bionet.nsc.ru/Dbases/MATRIX/ and the mirror site, http://www.cbil.upenn.edu/mgs/systems/c onsfreq/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ساختار مولکول DNA سه رشته ای: اهمیت و کاربردهای پزشکی آن

Back in 1957, when investigators produced a triple-stranded form of DNA while studying synthetic nucleic acids, few researchers paid much attention to the discovery. However, triplex DNA was never entirely forgotton and especially since 1987 its structural and functional importance in biological systems as well as its medical applications and therapeutic potentional have been extensively studie...

متن کامل

Targeting of the HIV-1 long terminal repeat with chromomycin potentiates the inhibitory effects of a triplex-forming oligonucleotide on Sp1-DNA interactions and in vitro transcription.

We have studied the effects of chromomycin and of a triple-helix-forming oligonucleotide (TFO) that recognizes Sp1 binding sites on protein-DNA interactions and HIV-1 transcription. Molecular interactions between chromomycin, the Sp1 TFO and target DNA sequences were studied by gel retardation, triplex affinity capture using streptavidin-coated magnetic beads and biosensor technology. We also d...

متن کامل

Molecular differentiation of sheep and cattle isolates of Fasciola hepatica using RAPD-PCR

Understanding genetic structure and status of genetic variation of Fasciola hepatica isolates from different hosts, has important implications on epidemiology and effective control of fasciolosis. Random amplified polymorphic DNA (RAPD-PCR) was used to study the genetic variation of F. hepatica in sheep and cattle. DNA was extracted from adult helminthes removed from livers of each infected ani...

متن کامل

Comparative assessment of plasmid and oligonucleotide DNA substrates in measurement of in vitro base excision repair activity

Mammalian base excision repair (BER) is mediated through at least two subpathways designated 'single-nucleotide' (SN) and 'long-patch' (LP) BER (2-nucleotides long/more repair patch). Two forms of DNA substrate are generally used for in vitro BER assays: oligonucleotide- and plasmid-based. For plasmid-based BER assays, the availability of large quantities of substrate DNA with a specific lesion...

متن کامل

Similarity of position frequency matrices for transcription factor binding sites

MOTIVATION Transcription-factor binding sites (TFBS) in promoter sequences of higher eukaryotes are commonly modeled using position frequency matrices (PFM). The ability to compare PFMs representing binding sites is especially important for de novo sequence motif discovery, where it is desirable to compare putative matrices to one another and to known matrices. RESULTS We describe a PFM simil...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 15 7-8  شماره 

صفحات  -

تاریخ انتشار 1999